Skyline Computation on Commercial Data

نویسندگان

  • Michael Galli
  • Stefan Schnürle
  • Ruedi Arnold
  • Marc Pouly
چکیده

• Our data set contains data on 55208 cars [1]. • To each car, 23 attributes are assigned. – correlated (e.g., cylinders and engine size). – anti-correlated (e.g., mileage and registration date). – nearly independent (e.g., mileage and horsepower). • Outliers countervail correlation effects. • Cardinalities differ greatly, e.g.: – 5988 different values for attribute price. – only 17 different values for color. – only 6% of all cars are assigned a unique value for price.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Adapting Skyline Computation to the MapReduce Framework: Algorithms and Experiments

This paper addresses the problem of skyline computation under the MapReduce framework. As a parallel programming model for data-intensive computing applications, MapReduce runs on a cluster of commercial PCs with the main idea of task decomposition and result reduction. Based on different data partitioning strategies, three MapReduce style skyline computation algorithms are developed: MapReduce...

متن کامل

Preference Analytics in EXASolution

Skyline queries and the more general concept of preferences are wellknown in the database community and there are many academic approaches for the computation of the best-matching objects. Furthermore, data analytics and multicriteria optimization play an important role in Business Intelligence where it facilitates optimal decision making. Preference Analytics is the combination of preferences ...

متن کامل

Catching the Best Views of Skyline: A Semantic Approach Based on Decisive Subspaces

The skyline operator is important for multicriteria decision making applications. Although many recent studies developed efficient methods to compute skyline objects in a specific space, the fundamental problem on the semantics of skylines remains open: Why and in which subspaces is (or is not) an object in the skyline? Practically, users may also be interested in the skylines in any subspaces....

متن کامل

Dissertation Defense Efficient and Adaptive Skyline Computation

Abstract: Skyline, also known as Maxima in computational geometry or Pareto in business management field, is important for many applications involving multi-criteria decision making. The skyline of a set of multi-dimensional data points consists of the points for which no other point exists that is better in at least one dimension and at least as good in every other dimension. Although skyline ...

متن کامل

Skyline: Stacking Optimal Solutions in Exact and Uncertain Worlds

In many applications involving multiple criteria optimal decision making, users may often want to make a personal trade-off among all optimal solutions for selecting one object that best fits their personal needs. As a key feature, skyline in a multi-dimensional space provides a minimal set of candidates for such purposes by removing every object that is not preferred by any (monotonic) utility...

متن کامل

Probabilistic Skyline Queries over Uncertain Moving Objects

Data uncertainty inherently exists in a large number of applications due to factors such as limitations of measuring equipments, update delay, and network bandwidth. Recently, modeling and querying uncertain data have attracted considerable attention from the database community. However, how to perform advanced analysis on uncertain data remains an interesting question. In this paper, we focus ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016